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[57] ABSTRACT 

Method and device for estimating motion between seg- 
mented images, or partitions, composed of a number I 
regions R f . For performing the method, this device com- 
prises a stage (41) for initializing parameters which describe 
the motion of each region R„ in which one of different 
motion hypotheses chosen a priori is selected after 
prediction, computation of the prediction error and selection 
of the motion hypothesis to which the smallest prediction 
error corresponds. This stage (41) is followed by a stage (42) 
for intermediate processing by filtering, and a stage (43) for 
refining the motion parameters and is looped to the input of 
the stage (42) for an iterative operation until it ends at a 
given criterion so as to finally obtain I motion information 
M ( {t) corresponding to the regions R-. 

11 Claims, 3 Drawing Sheets 
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METHOD AND DEVICE FOR ESTIMATING of each of these images can be considered as simple trans- 

MOTION BETWEEN IMAGES, SYSTEM FOR lations parallel to the plane of the image, while all the pixels 

ENCODING SEGMENTED IMAGES of a block are supposed to have the same motion. Thus, a 

block of the subsequent image may be compared with blocks 

This is a continuation of application Ser. No. 08/738,987 5 occupying the same position in the previous image or 

filed Oct 24 1996 neighboring positions bounded by a search window so as to 

select that block from these previous blocks of a defined 

BACKGROUND OF THE INVENTION number which most resembles the reference block of the 

subsequent image. The relative position of the selected block 

The invention relates to a method of estimating motion and of the reference block defines a motion vector indicating 

between images forming a sequence P(t-n), P(t-n+l), . . . , mc translation from one block to the other between the 

P(t-2), P(t-1), P(t), . . . , corresponding to a sequence S(t-n), pre vious image and the subsequent image. 

S(t-n+l), . . . , S(t-2), S(t-1), S©, . ... of segmented comparison operations, which are repeated for all 

images, or partitions, composed of I regions R f identified by me bJocks of the SUDsequem associate a field of 

labels, and to a corresponding device for motion estimation. motion vectors ^ ^ blocks When thc inforn)atiorj 

The invention also relates to a system of encoding seg- corresponding to the pixels of a block is to be subsequenUy 

mented images by means of this method and device. CQCodcd aad thcQ transmittcd and/or storcd , it fa sufficicnt t0 

The invention is particularly suitable for encoding video encode and then transmit and/or store the corresponding 

signals in the field of very small bitrates and in the field of motion vectors instead: based on the block selected in the 

small bitrates up to approximately 1 Mbit/second. This range ^ previous image, these vectors provide information about the 

of bitrates notably corresponds to consumer applications, Dew position of the block after its displacement in the 

often termed as multimedia applications. subsequent image under consideration. 

For ten to fifteen years, the compression of stationary or However, although this technique is also suitable for the 

animated images has become a major industrial technologi- applications mentioned hereinbefore, this technique has the 

cal feature progressively covering numerous sectors: digital 25 following drawback, namely, the images are perceived as 

television, data storage, telemonitoring, videophone sys- bidimensional signals without the effective contents of these 

terns. However, other applications are currently coming up images being taken into account: there is normally no reason 

such as, for example, multimedia applications based on local that the contours of the elementary blocks and those of the 

data networks, the transmission of images to mobile objects which are really present in the scenes coincide. The 

systems, or videophone systems for switched telephone 3Q block-matching method thus leads to degradation of the 

networks, constituting as many new challenges. While main- images, for example, when the boundary between two 

taining an equal image quality, these applications, which are objects of an image which are each moving in a distinct 

based on media used at a lower cost because of their reduced manner is in the middle of a block. In this case, the motion 

passband, require compression rates which are higher than estimation is no longer reliable and the restored images have 

those used within the framework of the major image encod- 35 a poor quality. When, in contrast, a region having a large 

ing standards such as H261, JPEG, MPEG1 or MPEG2. surface area and a homogeneous motion comprises numer- 

Moreover, services which are proposed for these media, ous blocks which are each associated with the same 

enable users to interact on the contents of the video images, information, the resultant excessive redundance of informa- 

i.e. to have direct access to the different constituent objects tion is also detrimental to the effectiveness of the encoding 

in order to manipulate them. Several methods complying 40 operation. 

with these requirements of compression and interactivity j t should be noted that the motions between one image 
will certainly be contradictory within the framework of a nd another may be naturally very different. The local 
standardization procedures for image encoding currently motion of objects, which may be assimilated with a trans- 
carried out by the MPEG committee (Moving Picture lation in numerous cases, is often superimposed on motions 
Experts Group) of the ISO (International Standardization 45 0 f me pick-up camera, such as zoom (motion at a fixed or 
Organization) for finalizing the future standard MPEG4 by variable focus along an axis perpendicular or transversal 
1998. ax j s t0 t he plane of the images) and panning motions 

Whatever the method which will then be used, the neces- (motions of rotation at a sufficiently small angle around an 

sity of compressing animated images requires efficient meth- axis which is substantially parallel to the plane of the 

ods of compensating motion of these images and thus a 50 images). If the local analysis of the motion has contributed, 

pre-estimation of these motions. On the other hand, the for example, to the determination of afield of motion vectors 

necessity of being able to interact on the image contents satisfactorily describing the translations between the blocks 

requires a representation which is referred to as motion of one image and another, these global zoom and/or panning 

"object" of different elements of each of these images. motions will disperse the field of vectors. A global analysis 

In a sequence of images, a conventional method of motion 55 of the motions must thus be effected simultaneously, but the 

estimation between two of these images (referred to as number of parameters describing all these motions becomes 

previous and subsequent images) consists of subdividing increasingly important. 

each of these images into a bidimensional network of In the envisaged applications, these restrictions of the 

adjacent elementary blocks of equal dimensions and of BMA block-matching method has led to the development of 

applying the block matching method, hereinafter referred to 60 other techniques which are based on a specific analysis of 

as BMA— of Block Matching Algorithm — which is the image and on a better comprehension of its structure, 

described, for example in the article "A VLSI architecture This analysis consists of considering an image as the pro- 

for hierarchical motion estimation" in the magazine "IEEE jection of a three-dimensional scene comprising stationary 

Transactions on Consumer Electronics", Vol. 41, No. 2, May and animated objects and of trying to identify these different 

1995, pp. 248-257). This technique supposes that the blocks 65 objects in each image and then estimate their representative 

are sufficiently small (composed of, for example 16x16 parameters (which are related, for example to their shape, 

pixels, which is a non-limitative example) so that the motion color, texture, motion, etc.), i.e. in a more general manner so 
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as to define a segmentation of images into regions R, which 
are both individual and homogeneous with respect to a given 
criterion. 

The document "Region-based motion analysis for video 
encoding at low bitrates" by H. Sanson, CCETT, pp. 1-8, 
published by ISO under the reference ISO/IEC-JTTCV 
SC29/WG11/MPEG94 in March 1994, describes a method 
with which both a segmentation of images of a video 
sequence into regions homogeneous with respect to motion 
and a satisfactory estimation of the parameters describing 
the motions in these regions can be performed. However, 
this method only seems to be appropriate in situations where 
no information relating to the image contents is available. 

SUMMARY OF THE INVENTION 

It is a first object of the invention to propose a motion 
estimation method based, in contrast, on a previous knowl- 
edge of these image contents, which knowledge is obtained 
during an analysis step which has led to the segmentation of 
these images. 

To this end, the invention relates to a method of estimat- 
ing motion between images as defined hereinbefore, which 
method successively comprises, for supplying, per region R,. 
and in the form of a motion vector defined by motion 
parameters, information M,(t) which is representative of the 
motion of the current image P(t) with respect to a previous 
image P(t-1), the following three operations each performed 
for each region of the current image: 

(1) a first step of initializing the motion parameters of 
each region R,- of the current image P(t) as a function 
of the images before segmentation P(t-1) and P(t), 
segmented images S(t-l) and S(t), and motion infor- 
mation M/t-1) estimated for the previous current 
image P(t-1) in a previous performance of the method; 

(2) a second step of intermediate processing of the images 
on which the estimation of the motion and of the 
parameters of this motion, is performed, and a third 
step of refining the motion parameters for the definitive 
determination of said parameters in the form of a vector 
(Dx, Dy) for all the pixels of the image of each of said 
regions R, in such a way that for each coordinate point 
(x,y) of the region, L(x,y,t)=L(x-Dx, y-Dy, t-1), L(.) 
designating the luminance or a video signal which is 
more complex, such as a combination of chrominance 
and luminance signals, and Dx, Dy being polynomials 
the degree of which is related to the type of motion of 
the region; 

(3) the iterative repetition of said second and third steps 
of intermediate processing and refinement until the end 
of this iterative process as a function of at least a given 
criterion so as to finally obtain the I motion information 
M,<t).. 

The motion estimation method thus proposed is advanta- 
geous in several respects. On the one hand, the required 
execution time to obtain a given estimation quality is, on 
average, shorter when the estimation process starts from 
values which are closer to those constituting the definitive 
result of the estimation, and the additional time required to 
effect this initialization step is negligible with respect to the 
time for estimation itself. Moreover, this initialization accel- 
erates the convergence of the proposed iterative process 
because the estimation method is supposed to provide only 
small relative variations of the motion parameters. 

In accordance with an advantageous detail of the 
invention, said first step of initializing the motion param- 
eters comprises the following sub-steps: 
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(A) a first sub -step of defining motion hypotheses for each 
region R ; considered; 

(B) a second sub-step of predicting, on the basis of the 
image P(t-1), the region corresponding to each region 

5 R, in the image P(t), taking each of the previous motion 
hypotheses into account, and of computing the corre- 
sponding prediction errors; 

(C) a third sub-step of comparing said computed predic- 
tion errors for each region R, considered and of 

10 selecting, as an initial motion of this region, that motion 
hypothesis which corresponds to the smallest predic- 
tion error. 

In a preferred embodiment of the method, said second 
step of intermediate processing comprises a filtering pre- 
15 processing operation by means of an isotropic Gaussian 
filter, while for the determination of the vector Dx, Dy 
associated with each region R,- considered, the third refine- 
ment step comprises an operation of minimizing the sum of 
the square values of the prediction errors for the pixels of the 
20 image of said region R ( . or a sub -set of these pixels, this sum 
being written as: £( X0 ,) (L(x,y,t)-L(x-Dx,y-Dy,t-1)) 2 , said 
minimization being effected, for example, by means of the 
Gauss-Newton approximation method, by effecting a first- 
order limited development of said sum after substituting Dx 
25 and Dy for (Dx 0 +dx) and (Dy 0 +dy), respectively, by making 
equal to 0 the derivatives of said limited development with 
respect to each coefficient of dx and dy, and by retaining, as 
variation values of the motion parameters leading to the 
smallest quadratic error, the solutions of the set of n equa- 
30 tions with n unknown quantities obtained subsequent to 
these operations. 

In a modification of the method, said three operations of 
initialization, intermediate processing and refinement may 
be followed by an additional operation of controlled propa- 
35 gation to the regions neighboring each region R,- in accor- 
dance with which the following sub-steps are repeated for 
each region R f : 
(a) a new sub-step of predicting, on the basis of the image 
P(t-l), each of said neighboring regions in the image 
40 P(t), taking a motion hypothesis in these regions into 
account which corresponds to that associated with the 
motion parameters resulting from said definitive 
determination, and of computing the corresponding 
additional prediction error; 
45 (b) a sub -step of comparing said additional prediction 
error and the prediction error associated with the 
motion hypothesis retained as the initial motion of the 
neighboring region considered, and of selecting, as 
motion of each of said neighboring regions, that one to 
50 which the smallest of these two prediction errors cor- 
responds. 

It is another object of the invention to provide a motion 
estimation device with which this method can be carried out. 
To this end, the invention relates to a device for estimating 

55 motion between images forming a sequence P(t-n), P(t-n+ 
1), . . . , P(t-2), P(t-l), P(t), . . . , corresponding to a 
sequence S(t-n), S(t-n+l), . . . , S(t-2), S(t-l), S(t), . . . , of 
segmented images, or partitions, composed of I regions R, 
identified by labels, which device is characterized in that, for 

60 supplying, per region R £ and in the form of a motion vector 
defined by motion parameters, information M t (t) which is 
representative of the motion of the current image P(t) with 
respect to a previous image P(t-l), the device successively 
comprises the following three stages, namely a first stage for 

65 initializing the motion parameters of each region of the 
current image by selecting motion hypotheses and by 
choosing, for each region, that motion for which the sum of 
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the square values of the prediction errors in the pixels of the 
image of the region is minimal, a second stage for interme- 
diate processing, and a third stage for refining the motion 
parameters by an approximated determination of the vector 
(Dx,Dy) associated with each region, said second and third s 
stages being provided in a loop in accordance with an 
iterative process until the end of this process as a function of 
a given criterion. 

Another object of the invention is to provide an encoding 
system suitable for performing the method and the motion 10 
estimation device in a system for encoding segmented 
images. 

To this end, the invention relates to a system of encoding 
segmented images, or partitions, corresponding to a 
sequence of initial images in which said partitions are 15 
bounded by contours and regions identified by associated 
labels, said system comprising means for generating, for 
each successive partition referred to as current partition, 
encoded signals corresponding to a partition referred to as 



formed after the motion estimation). From a practical point 
of view, these two textured images P(t-l) and P(t), between 
which the motion estimation is effected at the instant t, are 
the images of the luminance signal in this case, but may also 
correspond either to a combination of luminance and 
chrominance signals in the case where the color contains 
much specific information, or to any other transform of 
original images restituting the information of the signal. The 
value of the luminance at an arbitrary point (x,y) in one of 
the images P(t-2), P(t-l), P(t), etc. is denoted L(x,y,t-2), 
L(x,y,t-1), L(x,y,t), etc. Concerning the operation of seg- 
menting the images, a sequence of images of labels (also 
referred to as partitions) corresponds to the sequence of 
original images, and the segmented images are referred to as 
S(t-l), S(t) and correspond to the original textured images 
P(t-1), P(t) and finally form a sequence of a certain type of 
images resulting from a pre-analysis which is required to 
carry out the motion estimation method. FIG, 1 illustrates an 



optimal partition and composed either of regions extracted 20 example of segmenting an image into seven regions R„ with 
from a partition referred to as main partition and determined i=0 to 6. 
directly by motion estimation and compensation of a parti- 
tion preceding said current partition, or of regions extracted 
from additional partitions created by merging or, in contrast, 
by resegmentation of regions of said main partition, char- 25 
acterized in that, in view of said possible merging, the 
system comprises at least a circuit for merging the regions, 
comprising a motion estimation device as defined hereinbe- 
fore and a device for proposing the merging of regions 
concerned. 

These and other aspects of the invention will be apparent 
from and elucidated with reference to the embodiments 
described hereinafter. 



BRIEF DESCRIPTION OF THE DRAWING 
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In the drawings: 

FIG. 1 shows an example of a segmented image, the 
segmentation resulting from a pre-analysis of the original 
images (it will be noted that the principle of segmentation 
itself, in whichever form, does not form part of the 
invention); 

FIG. 2 diagrammatic ally shows a motion between two 
successive images P(t-l) and P(t); 

FIG, 3 illustrates the steps of the estimation process 45 
according to the invention, which is of an iterative nature; 

FIG. 4 shows an embodiment of a motion estimation 
device for performing the method; 

FIG. 5 shows an embodiment, in a system of encoding 
segmented images, of a succession of circuits for merging 
the regions (here, two circuits) each comprising a motion 
estimation device as shown in FIG. 4; 

FIG. 6 shows, for one region, an example of a local 
reference, with, for example, a coefficient 2 standardization. 55 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENTS 

Before describing the invention in detail, the different 
notations used will be defined herebelow, which notations 60 
will be used throughout the description. The images con- 
sidered form part of an original sequence of textured images, 
denoted P(t-n), P(t-n+l), . . . , P(t-2), P(t-1), P(t), etc. In 
the description which follows the focus is principally on 
P(t), the current image, and on P(t-l) which is the previous 65 
original image (or the previous encoded image, in accor- 
dance with the characteristics of the encoding process per- 



The information relating to the motion of the current 
image P(t) with respect to the previous image P(t-l) is 
arranged under the reference M ( -(t) for the region R t - of the 
image P(t). This reference M f (t) includes the data constituted 
by the type of motion retained (i.e. the order or degree of the 
polynomial representing this motion) and the values of the 
corresponding parameters (i.e. the values of the coefficients 
30 of the polynomials). For example, as illustrated in FIG. 2, a 
displacement vector (Dx,Dy) from P(t-1) to P(t), with Dx 
and Dy being polynomials at x and y whose coefficients are 
the motion parameters, corresponds to a type of motion 
determined between the images P(t-l) and P(t) at a point 
35 (x,y) of a region R, of P(t). This can be written as L(x,y,t)» 
L(x-Dx,y-Dy,t-1). The degree of these polynomials (0, 1 or 
2) and the number of coefficients defining them (from 2 to 
12 parameters) depend on the type of motion considered: 



(a) two parameters are sufficient for defining the transla- 
tion of a plane facet parallel to the image in a plane 
parallel to the image: 



Dx-a x , 



Dy~a 2 



(b) for a motion of the zoom and/or panning type, four 
parameters are necessary for modeling the motion of 
translation of a plane facet parallel to the plane of the 
image if the facet has an arbitrary orientation, or has an 
arbitrary translation motion of this facet if it is parallel 
to the plane of the image: 

Dx^a^a^x+ajy 

(c) for a related transform, six parameters are necessary 
for modeling a translation motion as indicated under (b) 
above or a rotational motion of a plane facet around an 
axis perpendicular to the plane of the image: 

i>K-a 4 +o 5 jr-K7 6 p 
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(d) for a quadratic motion, twelve parameters are neces- 
sary for modeling arbitrary rotations and translations of 
the curved facets: 

5 

These polynomial models have been adopted because it is 
possible to show that they represent the motion of objects in 
a satisfactory manner. However, they cannot be interpreted to 
as a strict description of the real three-dimensional motion of 
these objects. For this purpose, it is necessary to have the 
certainty that the objects are rigid and that their form is also 
known, which is not the case. The models in question are 
thus simply a representation of the deformation of the 15 
projection of the objects in the image plane (for example; in 
the case of two parameters, the model effectively describes 
a translation in the image plane, supposing that the object 
concerned is rigid and has a plane surface which is parallel 
to the image plane). A detailed representation of these 2Q 
motion models is given, for example in the article "Differ- 
ential methods for the identification of 2D and 3D motion 
models in image sequences", J. L. Dugelay and H. Sanson, 
Signal Processing: Image Communication, Vol. 7, No. 1, 
March 1995, pp. 105-127. 

The coordinates of a point in an image are denoted 25 
throughout the description in capitals (X,Y) or in lower case 
(x,y), dependent on whether they are expressed with respect 
to a global reference related only to the image or with 
respect to a local reference related to a given region of the 
image. 30 

As these notations have been recalled, it is useful to also 
recall that, in this case, the objective of motion estimation is 
to provide the possibility of subsequently restoring a pre- 
dicted image R(t) constituting an approximation of P(t), 
based on the segmented image S(t), of the previously 35 
restored image R(t-l) corresponding to P(t-l), and infor- 
mation M,{t) obtained during the motion estimation. Such a 
determination of R(t) provides, inter alia, the subsequent 
possibility of encoding the prediction error only, i.e. the 
difference between P(t) and R(t). 40 

The method according to the invention can now be 
described in detail with reference to FIG. 3. It comprises 
successively a first step 10 of initializing the motion param- 
eters (I NTT), a second pre-processing step 20 (PPRO), and 
a third step 30 of refining the motion parameters (REFT), 45 
each performed for each region of the current image. 

The first step 10 of initializing the motion parameters has 
for its object to start the motion estimation process with 
motion parameter values for each region R ( . of the image P(t) 
considered, which values are as close as possible to the final 50 
real values of these parameters so as to be able to suppose, 
throughout the processing operations envisaged, that the 
motion parameters have small variations. Moreover, the 
required execution time for obtaining a given estimation 
quality is, on average, shorter if one starts the estimation 55 
process with parameter values which are closer to the 
searched real values, which additional execution time of said 
first initialization step is negligible with respect to that of the 
estimation itself. It should be prevented that in the course of 
these processing operations, which, as will be seen, are 60 
effected in an iterative manner, the convergence can be 
produced at a local minimum, which would have more 
chances of being produced if the initial values were too 
remote from said searched real values. 

In three sub -steps referenced (A), (B) and (C) and per- 65 
formed for each region R, of the image P(t), the first step 
thus consists of: 
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(A) exploiting at best the information which is initially 
available at the input, and possibly gathering new 
information for deducing probable motion hypotheses; 

(B) for each of said motion hypotheses (expressed 
hereinafter) and based on known data for the image 
P(t-l), predicting the region concerned in P(t) and 
computing the corresponding prediction error; 

(C) selecting, as initial values of the motion parameters, 
those values which generate the smallest prediction 
error (which simultaneously validates one of the 
motion hypotheses). 

The first sub-step (A) of the step 10 of initialization INIT 
consists of exploiting the initial data, which are: 

(a) the original images P(t-1) and P(t); 

(b) the images of the labels S(t-l) and S(t); 

(c) the data M/t-1), i.e. the motion information (type of 
motion and values of corresponding parameters), 
including the case where the motion is zero, permitting, 
during the previous cycle (i.e. based on the image 
P(t-2)) to know the motions which lead to the image 
P(t-1); 

(d) the data resulting from performing the BMA technique 
(described above) between the images P(t-l) and P(t), 
namely a displacement vector defined for each pixel of 
P(t), and in general at about one pixel (or possibly at 
about half a pixel). 

For each region R, of P(t), four motion hypotheses with 
respect to the previous image have successively been taken 
into consideration in this case, taking their complementarity 
and simplicity of formulation into account in view of the 
available information: 

(1) the motion of the region R,- is zero; 

(2) the label i considered already existed in S(t-l) and the 
motion of the region is only a translation parallel to the 
image plane: it is then sufficient to compute the coor- 
dinates of the center of gravity of the label i in S(t-l) 
and S(t) and then the difference between these 
coordinates, which yields the displacement vector; 

(3) the label i considered already existed in S(t-1): j 
designating each label near i in S(t-l) and the data 
M/t-1) and M ; {t-1) being expressed in the local ref- 
erence associated with i, all the labels j neighboring i in 
S(t-l) are searched and then the corresponding data 
My(t) are read (type of motion and values of corre- 
sponding parameters converted in this local reference), 
and the best motion between the label i and its neigh- 
boring labels is chosen; 

(4) the motion obtained is that which corresponds to the 
best approximation of the field of displacement vectors 
resulting from an adaptation of the BMA technique per 
region (only the displacements computed for blocks of 
more than half the number included in the region i 
considered being taken into account), said adaptation 
consisting of a sequence of translation al motion esti- 
mations of blocks having a variable size and of relax- 
ations so as to subsequently determine an approxima- 
tion of the field of vectors by means of a more complex 
model and with the aid of a regression technique (this 
method of adapting the polynomial from a set of values 
is similar to the polynomial approximation method 
described, for example for encoding textured images in 
the article "Coding of arbitrarily shaped image seg- 
ments based on a generalized orthogonal transform" by 
M. Gilge, T. Engelhardt and R. Mehlan, in the maga- 
zine "Signal Processing: Image Communication, Vol. 
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1, No. 2, October 1989, pp. 153-180, which example is square values of the prediction errors in the pixels of the 

not limitative). region is minimized, i.e. the following expression: 
The second sub-step (B) of the step 10 for initialization 

INIT consists of predicting, on the basis of P(t-1), the sahy.'Kfr-^y-PB'-i)) 1 (D 

corresponding region in P(t), taking into account each 5 „ , , . „ , 

motion hypothesis effected during the sub^tep (A), and of for * ^ * ^ng all the values cmresponding to the 

subsequently computing each time the prediction error for coordinates of the pixels in the region R, considered. This 

the region. The following prediction principle is used: with exp^on (1) is denotec 1 in an abbreviated form m the 

P(t-1), S(t) and M,<t) being known, the predicted luminance f°»°™g manner (DFD=I>isplaced Erame Difference): 

value in the image P(t) is determined at a position X,Y of a 10 2 mfD^y^Qy^y 
pixel. The detailed description of the prediction will be 

given hereinafter in the part dealing with the refinement, for it is known that such a mathematical minimizing operation 

the definitive estimation of the motion parameters. ( 0 f expression (2)) may be notably effected by means of the 

Finally, the third sub-step (C) of the step 10 for initial- Gauss-Newton method for which Dx=(Dx 0 +dx) and Dy 

ization IN1T consists of comparing the computed prediction 15 =(Dy 0 +dy) with dx and dy being very small with respect to 

errors in a region and of preserving, as initial motion of the and Dy 0 , respectively. By first -order development, we 

region, that one to which the smallest prediction error obtain: 
corresponds. The process is repeated for each region and the 

first initialization step 10 is ended when the motion param- L(x-Dx ) y-Dy,t-i)-L(x-Dx o y-Dy 0 s-i) 

eters have thus been adjusted for all the regions of P(t). The 20 

set of initial parameters thus determined for a region R, is -dxi—Ux-Dx y-Dy r- 1) ^ 

designated by the reference M' ja (t) m FKi. 3. \&* °* 

The second intermediate processing step 20 has for its r d 
object to facilitate the estimation of definitive motion param- " dy \dy ^ x ~ Dy °* y ~ Dy ° y 1 ~ ^ 
eters obtained at the end of the third and last step. Without 25 
this being the only possibility, an essential processing opera- 
tion for obtaining this objective is to modify the luminance The expression (1) to be minimized thus will be: 
signal so as to bring it closer to the theoretical ideal (a 

first-order function) i.e. so as to verify the mathematical v K JDFD(xj,DxoDyj 

hypothesis required by the theory in order to obtain a 30 a 

convergence of the estimation process. This processing ~dx\ — Ux~Dx at y- Dy 0 , t- 1) 
operation consists of a filtering of P(t-l) and of P(t), for 

example by using an isotropic Gaussian filter in the four -dyi—Ux-Dx,, y-Dy r-l)f 

directions of the plane (S(t-l), S(t), M,{0 are not modified). w ' ° J 

This choice of filter ensures a very good compromise 35 

between a smoothing of the contours, which is useful for T , . , AK „ , . . , , „ 

simplifying the luminance signal in the image and facilitat- * D thls (?). U» derivatives w.th respect to each of 

ing the convergence by avoiding the local minima as much ! he ^^.ents of dx and dy are made equal to 0 eliminated 

as possible, and the maintenance of a sufficient localization ln orde ( r , t0 the minimum of this expression with 

of these contours in the image (it is desirable to preserve 40 10 «^ "notion parameters, which leads to a set of n 

enough details of the image in order that the precision of the equations with n unknown quantities, 

estimated motion is sufficient). The filtered images are u ^ soluuons of this of equations are the variations of 

designated by the references P(t-1) and P'(t) in FIG. 3. The thc m ° tlon I"™™.*" }" din S t0 } h ' smiJlest quadratic 

pa( r mnt -. n „ .i „ r . .,■ error, lo resolve this, it is expressed in a matrix form: 

set ot motion parameters available a iter this preprocessing * r 

operation realized during step 20 is designated by the 45 uUxhfBl ts) 
reference M' 23 (t) in FIG. 3. 

The third step 30 of refining the motion parameters, which The vector x represents the searched parameters and the 

step is iterative, has for its object to effect the definitive terms of the matrices depend on the coordinates of the pixels 

estimation of the motion parameters for the region con- of the current image, on the gradients which are horizontal 

cerned. The iterative process performed during this step ends 50 and vertical to the previous positions of the pixels (in the 

at a given criterion, for example when a number of iterations previous image) and on the luminance values at these 

fixed in advance is reached (other criteria may also be positions in the current and the previous image. For each 

proposed, such as a stop of iterations when a sufficient region R ( of S(t) and at each iteration, the matrices A and B 

quality during motion compensation permitting the previous must be constructed, the matrix A must be inverted and the 

motion estimation is reached or when the improvement for 55 inverse matrix thus obtained must be multiplied by the 

the new iteration becomes negligible, or even a combination matrix B to obtain the vector solution x: the motion infor- 

of several criteria). mation (for the region R ( ) may then be updated by adding 

First of all, it should be recalled that, for each region of components of this vector solution x to the previous expres- 

P(t), a vector (Dx,Dy) is searched so that, for each point in sion of this motion information. 

the region, L(x,y,t)=L(x-Dx,y-Dy,t-l), in which Dx and Dy 60 For constructing the matrix A, the luminance values must 

are polynomials of a degree related to the type of motion for be computed at the points in the previous image correspond- 

the region considered. The equality between these two terms ing to previous positions of the points in the current image, 

L(.) can only be realized in a more or less approximative of which the motion and the positions in this current image 

manner in accordance with the degree of quality of the are known — it is the prediction operation described below — 

motion estimation, but in order that this approximation is as 65 and the values of the horizontal gradient and the vertical 

satisfactory as possible, the criterion used is the one for gradient at similar points must be computed. The computa- 

determining the smallest quadratic error: the sum of the tions must be performed on values of parameters expressed 
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in their local reference (i.e. related to the region). For each 
region, two sets of parameters are stored, on the one hand, 
the parameters during estimation, while converging, which 
is denoted M^ v (t) 7 and, on the other hand, the parameters 
giving the best results for the region, which is denoted M/(t). 
At the start of the refinement step 30, these two motions are 
equal to the initial motion (M , 12 (0 m FIG. 3) originating 
from the initialization of the processed parameters. 
Subsequently, M/^t) is defined in an iterative manner and 
substituted at the end of each iteration for M/(t) which 
corresponds to the motion giving the best results among the 
following motions for the region under study: either the 
parameters M ( cv (0 correctly computed for the current 
region, or the parameters M/^t) of the regions j neighboring 
i in S(t), which motions are reconverted in the local refer- 
ence corresponding to the region i. Finally, this retained 
motion may give rise to a propagation towards the neigh- 
boring regions: for these neighboring regions, the search for 
the smallest prediction error on the basis of this motion is 
resumed, which is selected if it effectively leads to a smaller 
prediction error, and so forth. At the output of each iteration 
of the refinement step, the motion information determined 
for each region i of S(t) is sent towards the input of step 20 
(parameters designated by M 32 '(t) in FIG. 3) 

The prediction operation which is necessary for the con- 
struction of the matrix A will now be described. Given a 
pixel of coordinates (X,Y), the prediction enables determi- 
nation of the predicted luminance value at this position at the 
instant t, denoted L(X,Y,t), based on S(t), P(t-1) and M(t). 
This operation, which is performed at each point of the 
image, consists of two steps: 

(a) computation of the non-integral coordinates of this 
pixel at the instant (t-1): 

search of the label i of the region to which the pixel 

belongs by reading the image of tbe labels S(t) at the 

position (X,Y); 
for this pixel, selection of its motion information (type of 

motion and value of the parameters) by reading M ( {t) 

for this label; 

computation of the displacement (Dx,Dy) of the pixel as 
a function of its coordinates, of the type of motion and 
the value of the parameters of its region (for example, 
in the case of a related motion, for which there are 6 
parameters, one has: (Dx,Dy =(a 1 +a 2 .X+a 3 .Y,a 4 +a 3 .X+ 
a 6 .Y) if the motion parameters are expressed with 
respect to the global reference, or (dx^y)=(a 1 +a 2 .x+ 
a3-y»a4+a5-x+a 6 .y) if the motion parameters are 
expressed with respect to a local reference at their 
region), said displacement thus providing the possibil- 
ity for this pixel of deducing its coordinates at (t-1): 
(X-Dx,Y-Dy) (if these coordinates are outside the 
image, one takes the coordinates of the closest point 
associated with the image, but the coordinates found 
are not necessarily integral values because the param- 
eters are not, and thus an interpolation must be carried 
out for deducing the luminance at this point); 

(b) computation of the luminance and its coordinates in 
P(t-l): the luminance is interpolated in the present case 
by using a bicubic monodimensional filter having a 
length 5 with a precision of Vieth of a pixel (the 
interpolation is effected horizontally and vertically with 
the same filter, by means of an operation referred to as 
mirroring at the edges of the image in the case of pixels 
at the edge of the image so as to obtain two luminance 
values, the mean value of which is preserved). 

For the computations of the gradient, it is necessary to 
interpolate their values, similarly as for the lurninance. To 
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ensure the coherence of this operation with the interpolation 
used during prediction, the filter used is derived from that 
used for the luminance, of length 5, with a precision at V^nd 
of a pixel. The interpolation technique is the same as for the 

5 luminance, except that the values resulting from the hori- 
zontal and vertical filtering operations are used indepen- 
dently and are thus not averaged (the same mirroring opera- 
tion as for the luminance may be performed). 
In theory, the matrix A must be constructed with the aid 

10 of a sum of terms at all the points of the region. Certain 
points may be error factors (association with a small object 
or parasitic motion, with an uncovered zone . . . ). A simple 
restriction of selecting the points is to preserve only the 
points (x,y) whose motion actually estimated is such that 

15 S(x,y,t)=S(x-Dx,y-Dy,t-l). This restriction is the more 
efficient as the segmentation is more coherent with the 
contents of the image. 

Once the matrix A is constructed, it is tested whether it is 
singular or not. If it is not, an inversion by means of the 

20 method referred to as the Householder method is performed. 
If it is, the motion is not refined and the motion parameters 
(the information M,-**) remain unchanged at tbe start of this 
iteration and one proceeds directly to the selection of the 
better motion. The region is predicted by using the infor- 

25 mation M/ V (t) which has been supplied in the refinement 
step, and the information M ; {t) corresponding to every 
region j neighboring i in S(t) and expressed in the local 
reference at i. The prediction error in the region is computed 
each time. If all the errors are higher than those correspond - 

30 ing to the previous information M/(t), this better motion 
remains unchanged. If not, the motion generating the small- 
est prediction error is definitively retained, and the 
corresponding, new information is denoted M/(t). As has 
been seen above, a controlled propagation of the retained 

35 motion is possible. For each region R f adjacent to R„ the 
prediction error in this region is computed on the basis of 
parameters M/(t), and M/(t) is substituted by M/(t) if this 
error is smaller than the prediction error obtained from My(t). 
The method described hereinbefore may be performed in 

40 a motion estimation device 40 shown in FIG. 4, which in this 
case comprises a first stage 41 for initializing the motion 
parameters of each region of the current image, for perform- 
ing the first initialization step 10 by selecting the motion 
hypotheses and by choosing, for each region, that motion for 

45 which the sum of the square values of the prediction errors 
in the pixels of the region is minimal, followed by a second 
intermediate processing stage 42 for performing the second 
step 20, and a third stage 43 for refining the motion 
parameters for performing the third step 30 by an approxi- 

50 mated determination of the vector (Dx,Dy) associated with 
each region. These second and third stages 42 and 43 are 
provided in a loop in accordance with an iterative process 
until the end of this process as a function of a given criterion 
(in this case, the intermediate processing operation com- 

55 prises an isotropic Gaussian filtering operation intended to 
accelerate the convergence of this iterative process). 

A particularly important use of this motion estimation 
device is its place in a system for encoding segmented 
images as described in European Patent Application No. 

60 95402389.1 filed on 25 Oct. 1995. Without recalling all the 
details of the system described in this Application, it is 
useful to define its essential characteristic. More particularly, 
this system intended for encoding segmented images, or 
partitions, corresponding to a sequence of initial images in 

65 which said partitions are bounded by contours and thus by 
regions identified by associated labels, comprises, before the 
encoding itself, an assembly of means for generating, for 
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each successive partition referred to as current partition, a than those taken into consideration, or to choose a larger 

partition referred to as optimal partition and defined in number of them before performing the selection, as initial 

accordance with a specific process based on said current motion of the region R,- considered, of that one which 

partition. generates the smallest prediction error. 

This optimal partition is conventionally composed of 5 In the course of the second step 20 of intermediate 

juxtaposed regions, but the source of these regions is origi- processing, it is also possible to accelerate the process of 

nal: they are regions either extracted directly from a main estimating the parameters by way of a sub-sampling 

partition, or projected partition, determined by motion esti- operation, or, after having facilitated the convergence of this 

mation and compensation of a partition preceding the cur- process by way of the filtering operation proposed 

rent partition, or regions extracted from additional partitions JQ hereinbefore, to prevent it from ending at a local minimum, 

created by merging or, in contrast, by resegmentation of by effecting a hierarchic analysis of the images. The esti- 

regionsof said main partition. A kind of tree of partitions at mation principle remains the same, but the differences of 

various levels of regions is thus constituted, providing an implementation (although this is simplified due to the fact 

assembly of regions from which, by means of a process of that one ^ now closer l0 the motion to be estimated) are the 

decisions, the final set of regions constituting the optimal following' 

partition to be encoded is chosen for each current partition. 15 - , . . . _ . . (l 

y ... . c • j- initialization: for each region, the hypothesis zero 

To realize the possible merging of regions in this encoding „ ... . < t£ . ,. / . „ /L 

system, the system comprises at lelst a circuit 50 fol moUO ? or the hypothesis 'field of vectors (both 

merging the regions, comprising, as illustrated in FIG. 5, a described above > 15 te r st <; d > subsequently the motions of 

motion estimation device 51 and a device 52 for proposing the P** cnt rc S lon of thc re & oa considered and of the 

the merging of regions. Tne device 51, which is of the type 20 P arent regions of the regions adjacent to this region 

described above (device 40), receives two original textured considered are tested in the case where S(t) results from 

images P(t-l) and P(t), on the one hand, referred to as a Dew segmentation, or, in the case where S(t) results 

previous image and current image, respectively, and the &om a merging of regions, the motions of all the 

segmented image or partition S(t-l) which corresponds to "children" regions of the region considered and the 

P(t-1), on the other hand, as well as the projected partition 25 children regions of the regions adjacent to this region 

denoted PJ(t), and computes the motion parameters of this considered are tested, 

partition PJ(t) describing the motion of each region of PJ(t) preprocessing: no filtering operation; 

between the instants (t-1) and t. refinement: performed on the non-filtered images. 

When the motion parameters of PJ(t) are known, an During the third step 30 of refining the motion parameters, 

estimation of the cost consequences of merging neighboring 30 the refinement computations are effected in a local reference 

regions may be made in the device 52. For each possible pair in the region considered whose center is the center of the 

of adjacent regions, this merging consequence is computed horizontal rectangle circumscribing the region and whose 

and the pairs of regions presenting the smallest conse- axes are parallel to the edges of the image, while the units 

quencesof merging are selected. For example, in accordance at x and y are the maximum dimensions of the region in 

with a first merging criterion, two adjacent regions are 35 these directions, divided by a standardization coefficient, 

re -arranged (to a single one) if the consequence of encoding FIG. 6 shows such an example of a local reference in the 

the prediction error corresponding to the situation where case where the standardization coefficient is equal to 2 for a 

these two regions are now considered as a single one (i.e. region of the shape shown. An example of changing the 

when they are motion-compensated by means of the same reference for the horizontal component of a related motion 

set of motion parameters) is smaller than the consequence of 40 is clarified in this case for a region having maximum 

encoding the contour constituting the boundary between the horizontal and vertical dimensions Lx and Ly, with an 

two regions. In accordance with another criterion, which associated local reference whose center has coordinates 

may be used if the encoding method is not known in (Xc,Yc) with respect to the global reference related to the 

advance, the compensation error may be taken into account: image and whose horizontal motion component expressed 

the consequence considered is the variation of the mean 45 locally is: Dx=a+b.x+c.y. If N is the standardization 

quadratic (compensation) error when the two regions con- coefficient, the parameters expressed in the global reference 

cerned are separately compensated. A proposition for merg- related to the image will be A, B and C such that, indepen- 

ing these two regions is then possibly made, so that in the dent of X and Y in this region R: 
case of merging new partitions PM^t-l) and PM 3 (t) are 

henceforth available. 50 X-Xc Y-Yc 

These two partitions PM^t-l) and PM^t) constitute, a + b-x + c-y = a + b • + c '-jj- = A + B-X + C-Y 

with the original images P(t-l) and P(t), the signals which, n n 
for a possible repetition of the motion estimation and 
merging proposition processes, are transmitted to a second 

circuit 506 for merging the regions (comprising a second 55 
motion estimation device 516 and a second merging propo- 
sition device 526), and so forth for other possible merging 
levels. The assembly of these merging levels is then asso- 
ciated with similar levels obtained by resegmentation for 

constituting the tree of partitions in which the regions 60 
constituting the optimal partition are chosen. 

It should be noted that the invention is not limited to the 

embodiments described, and that many variants based on Other embodiments are also possible, for example, the 

these embodiments may be conceived without passing bicubic filters which are used for the prediction operation 

beyond the scope of the invention. 65 and have a fixed pitch, but may also have a variable pitch, 

For example, in the first step 10 of initializing the motion or, when the matrix A is constructed, the possible weighting 

parameters, it is possible to choose motion hypotheses other of contributions of different pixels of a region so as to 
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preclude the pixels which are subject to noise or to occlusion 
phenomena and to give more weight to the points where the 
luminance variation is induced by the searched motion. It 
will also be noted that the previous description only corre- 
sponds to particular implementations of the invention and 
that, for example, an embodiment including particularly a 
microprocessor for executing the series of instructions cor- 
responding to the functions of certain or all elements in these 
previous embodiments is possible, all this in conformity 
with the invention. 
I claim: 

1. A method of estimating motion between images form- 
ing a sequence P(t-n), P(t-n+l), . . . , P(t-2), P(t-l), P(t), 
. . . , corresponding to a sequence S(t-n), S(t-n+l), . . . , 
S(t-2), S(t-l), S(t), . . . , of segmented images, or partitions, 
composed of I regions R t identified by labels, said method 
successively comprising, for supplying, per region R, and in 
the form of a motion vector defined by motion parameters, 
information M/t) which is representative of the motion of 
the current image P(t) with respect to a previous image 
P(t-l), the following three operations each performed for 
each region of the current image: 

(1) a first step of initializing the motion parameters of 
each region R f of the current image P(t), the initialized 
values of the motion parameters being a function of the 
images before segmentation P(t-1) and P(t), the seg- 
mented images S(t-l) and S(t), and motion information 
M t -(t-l) estimated for the previous current image P(t-1) 
in a previous performance of the method; 

(2) a second step of intermediate processing of the images 
on which the estimation of the motion and of the 
parameters of this motion, is performed, and a third 
step of refining the motion parameters for the definitive 
determination of said parameters in the form of a vector 
(Dx, Dy) for all the pixels of the image of each of said 
regions R t - in such a way that for each coordinate point 
(x,y) of the region, L(x,y,t)=L(x-Dx, y-Dy, t-1), L(.) 
designating the luminance or a video signal which is 
more complex, such as a combination of chrominance 
and luminance signals, and Dx, Dy being polynomials 
the degree of which is related to the type of motion of 
the region; 

(3) the iterative repetition of said second and third steps 
of intermediate processing and refinement until the end 
of this iterative process as a function of at least a given 
criterion so as to finally obtain the I motion information 
M,<t). 

2. A method as claimed in claim 1, in which said first step 
of initializing the motion parameters comprises the follow- 
ing sub-steps: 

(A) a first sub-step of defining motion hypotheses for each 
region R ( . considered; 

(B) a second sub-step of predicting, on the basis of the 
image P(t-1), the region corresponding to each region 
R t . in the image P(t), taking each of the previous motion 
hypotheses into account, and of computing the corre- 
sponding prediction errors; 

(C) a third sub-step of comparing said computed predic- 
tion errors for each region R,- considered and of 
selecting, as an initial motion of this region, that motion 
hypothesis which corresponds to the smallest predic- 
tion error. 

3. A method as claimed in claim 2, in which said second 
step of intermediate processing comprises a filtering pre- 
processing operation by means of an isotropic Gaussian 
filter. 
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4. A method as claimed in claim 2, wherein said three 
operations of initialization, intermediate processing and 
refinement are followed by an additional operation of con- 
trolled propagation to the regions neighboring each region 
R, in accordance with which the following sub-steps are 
repeated for each region R f : 

(a) a new sub- step of predicting, on the basis of the image 
P(t-l), each of said neighboring regions in the image 
P(t), taking a motion hypothesis in these regions into 
account which corresponds to that associated with the 
motion parameters resulting from said definitive 
determination, and of computing the corresponding 
additional prediction error; 

(b) a sub-step of comparing said additional prediction 
error and the prediction error associated with the 
motion hypothesis retained as the initial motion of the 
neighboring region considered, and of selecting, as 
motion of each of said neighboring regions, that one to 
which the smallest of these two prediction errors cor- 
responds. 

5. A method of estimating motion between images form- 
ing a sequence P(t-n), P(t-n+l), . . . , P(t-2), P(t-1), P(t), 
. . . , corresponding to a sequence S(t-n), S(t-n+l), .... 
S(t-2), S(t-1), S(t), . . . , of segmented images, or partitions, 
composed of I regions R,- identified by labels, said method 
successively comprising, for supplying, per region R, and in 
the form of a motion vector defined by motion parameters, 
information M t (i) which is representative of the motion of 
the current image P(t) with respect to a previous image 
P(t-1), the following three operations each performed for 
each region of the current image: 

(1) a first step of initializing the motion parameters of 
each region R t - of the current image P(t) as a function 
of the images before segmentation P(t-l) and P(t), 
segmented images S(t-l) and S(t), and motion infor- 
mation M/l-l) estimated for the previous current 
image P(t-l) in a previous performance of the method; 
wherein said first step comprises the sub-steps of (a) 
defining motion hypotheses for each region R, consid- 
ered; (b) predicting, on the basis of the image P(t-l), 
the region corresponding to each region R, in the image 
P(t), taking each of the previous motion hypotheses 
into account, and of computing the corresponding 
prediction errors; and (c) comparing said computed 
prediction errors for each region R,- considered and of 
selecting, as an initial motion of this region, that motion 
hypothesis which corresponds to the smallest predic- 
tion error; 

(2) a second step of intermediate processing of the images 
on which the estimation of the motion and of the 
parameters of this motion, is performed, and a third 
step of refining the motion parameters for the definitive 
determination of said parameters in the form of a vector 
(Dx, Dy) for all the pixels of the image of each of said 
regions R ( - in such a way that for each coordinate point 
(x,y) of the region, L(x,y,t)=L(x-Dx, y-Dy, t-1), L(.) 
designating the luminance or a video signal which is 
more complex, such as a combination of chrominance 
and luminance signals, and Dx, Dy being polynomials 
the degree of which is related to the type of motion of 
the region; 

(3) the iterative repetition of said second and third steps 
of intermediate processing and refinement until the end 
of this iterative process as a function of at least a given 
criterion so as to finally obtain the I motion information 
M/t) wherein for the determination of the vector 
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Dx,Dy associated with each region R ( - considered, said 
third refinement step comprises an operation of mini- 
mizing the sum of the square values of the prediction 
errors for the pixels of the image of said region R ( - or a 
sub-set of these pixels, this sum being written as: 
Z^CLCx^O-LXx-Dx^-Dy, t-l) 2 . 

6. A method as claimed in claim 5, wherein said minimi- 
zation is effected by means of the Gauss-Newton approxi- 
mation method by effecting a first-order limited develop- 
ment of said sum after substituting Dx and Dy for (Dxq-kIx) 
and (Dy 0 +dy), respectively, by making equal to 0 the deriva- 
tives of said limited development with respect to each 
coefficient of dx and dy, and by retaining, as variation values 
of the motion parameters leading to the smallest quadratic 
error, the solutions of the set of n equations with n unknown 
quantities obtained subsequent to these operations. 

7. A device for estimating motion between images form- 
ing a sequence P(t-n), P(t-n+l), .... P(t-2), P(t-l), P(t), 
. . . , corresponding to a sequence S(t-n), S(t-n+l), . . . , 
S(t-2), S(t-l), S(t), . . . , of segmented images, or partitions, 
composed of I regions R, identified by labels, and for 
supplying, per region R, and in the form of a motion vector 
defined by motion parameters, information M^t) which is 
representative of the motion of the current image P(t) with 
respect to a previous image P(t-l), the device comprising: 

a first stage for initializing the motion parameters of each 
region of the current image, the initial values of the 
motion parameters being selected based on the criterion 
of motion hypotheses and for each region, that motion 
for which the sum of the square values of the prediction 
errors in the pixels of the image of the region is 
minimal, a second stage for intermediate processing, 
and a third stage for refining the motion parameters by 
an approximated determination of the vector (Dx,Dy) 
associated with each region, and means for repeating 
the intermediate processing and refining process for 
each region until at least one preselected criterion is 
met. 

8. A device as claimed in claim 7, characterized in that 
said intermediate processing operation comprises an isotro- 
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pic Gaussian filtering operation intended to accelerate the 
convergence of said iterative process. 

9. A device as claimed in claim 7, characterized in that 
said motion hypotheses amount to four, namely: zero 

5 motion, motion having a translation equivalent to that of the 
center of gravity of the region, motion evaluated as the best 
between this label and its neighbors in the previous partition, 
and motion estimated by implementing a block-matching 

10 method referred to as BMA. 

10. Asystem of encoding segmented images, or partitions, 
corresponding to a sequence of initial images in which said 
partitions are bounded by contours and regions identified by 
associated labels, said system comprising means for 

15 generating, for each successive partition referred to as 
current partition, encoded signals corresponding to a parti- 
tion referred to as optimal partition and composed either of 
regions extracted from a partition referred to as main par- 
tition and determined directly by motion estimation and 

20 compensation of a partition preceding said current partition, 
or of regions extracted from additional partitions created by 
merging or, in contrast, by resegmentation of regions of said 
main partition, characterized in that, in view of said possible 

25 merging, the system comprises at least a circuit for merging 
the regions, comprising a motion estimation device as 
claimed in claim 7 and a device for proposing the merging 
of regions concerned. 

11. An encoding system as claimed in claim 10, charac- 
30 terized in that it comprises several region -merging circuits in 

series for constituting as many levels of distinct partition 
levels referred to as upper levels, said upper levels being 
associated with said main partition and with lower levels 
determined in a similar manner by way of successive 
35 resegmentations for constituting a tree of partitions provid- 
ing a complete assembly of regions from which the final set 
of regions constituting said optimal partition to be encoded 
is chosen for each current partition. 

***** 
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